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PREFACE 


In the academic year 1927-28 I gave the gradu- 
ate course in Statistical Theory and Analysis at 
Harvard University. The point of view of sta- 
tistics which I emphasized in that course was that 
the subject is not a collection of numerical facts, 
but rather a method and a logic; that the science 
of statistics is an applied science in which it is 
just as necessary to scrutinize the premises as 
the logic and mathematics; that mechanistic 
and universalistic notions of statistics are mis- 
taken; that statistical methods are frequently 
dependent upon the peculiar data to which 
they are applied; that such methods are sound 
when they can be retained in actual practice, even 
though the methods are not applicable to data of 
another sort; and that economic statistics has 
the same relation to economics and mathematics 
as mathematical physics has to physics and 
mathematics. The construction of index numbers 
was discussed from the point of view just indi- 
cated, that is, as a problem necessarily involving 
not only economics and mathematics, but also 
the nature of the data utilized. The discussion 
referred to is presented in the following pages. 
WARREN M. PERSONS 


August 15, 1928. 
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THE CONSTRUCTION OF 
INDEX NUMBERS 


F the price (p,) of any commodity, such as 
wheat, is $1.25 per bushel in period 1 and the 
price (2) in period 2 is $1.50 per bushel, the 
relative price in period 2 as compared with 
period 1 (f2/p1) is $1.50+$1.25 =1.20=120 per 
cent. The relative price in period 1 as compared 
with period 2 (p:/p2) is $1.25 +$1.50=0.83%= 
8314 per cent. The product of the two price 
relatives computed forward and backward (2/1 . 
pi/p2), obviously, always equals 1. 

If the prices (p’: and ’2) of a second com- 
modity, such as pig iron, are $25 per ton and 
$20 per ton in periods 1 and 2, respectively, the 
two price relatives, computed forward and back- 
ward, will be 80 per cent and 125 per cent. 

If our object be the comparison of the actual 
price change of wheat per unit of quantity with 
that of pig iron per unit of quantity, between two 
periods, the units of quantity which we use for 
the two commodities, whether pounds, bushels 
or tons, will, of course, determine the size of 
the actual increase or decrease. That is, the 
units of quantity selected determine the magni- 
tude of the increments of prices of the two 
commodities. 

If we are interested in the comparison of the 
relative price change of wheat with that of pig 
iron, during the interval between periods 1 and 
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2, and take the ratios of actual prices instead 
of the differences between them, the units of 
quantity used for the quotation of prices of the 
two commodities do not affect the result. That 
is, the price relatives for two or more commod- 
ities are independent of the units of quantity 
used for the several commodities and, being 
expressed in a common unit (percentage), are 
prima facie comparable. 

The object of the present study is that of 
measuring the average change of prices of two 
or more commodities during two or more periods. 
In other words, our problem is that of averaging 
a number of price relatives. 

Starting with the basic concept of the relative 
price change of a single commodity during one 
interval, we then compute price relatives for a 
number of commodities, intervals, and bases and 
observe that in actual practice whatever arbi- 
trary period we use as the base, we obtain 
magnitudes differing substantially from each 
other for the several commodities. The re- 
sults for special cases indicate two empirical 
generalizations: 

(a) The magnitude of the price relatives 
depends upon the base which we arbitrarily 
select. 

(0) The distribution of the price relatives 
with respect to each other depends upon the 
base which we arbitrarily select. ° 

These generalizations at once suggest a ques- 
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tion. If we asswme an arbitrary base, is there a 
plane or level of distribution of price relatives 
—constant for a given period — which is 
superior to other magnitudes as a concept to be 
retained in actual practice? This is the central 
problem of averages. 

“ Averaging”’ magnitudes is associated with 
familiar mathematical concepts and processes 
and, in order to obtain an average or summary 
figure which is a useful working concept, we 
must utilize these familiar mathematical proc- 
esses. The process may be that of summation, 
leading to the arithmetic and harmonic means, 
or of multiplication, leading to the geometric 
mean, or of arranging the items in order of mag- 
nitude, leading to the median, or of finding the 
item which appears most frequently, leading to 
the mode. In any case, that method of averag- 
ing is to be preferred which is not only mathe- 
matically sound and self-consistent, but also is 
not too complicated to be readily understood 
and kept in mind. That is, it is essential that, 
as a working concept, the average be simple as 
well as accurate. Consequently, our discussion 
of averages will proceed from the familiar forms. 
Moreover, since we wish to obtain an average 
applicable to few commodities, ten or twenty, 
as well as to many commodities, fifty to a thou- 
sand, we shall discard the median and mode at the 
outset. The median and mode are unsatisfac- 
tory for the reason that they can be determined 
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only within wide limits unless the items are very 
numerous. Further, even for a large number of 
items, the median and mode have serious defects 
for our present purpose:! these two averages are 
influenced only slightly, if at all, by substantial 
changes among the relatives, and the mode may 
be indeterminate. Our discussion of averages 
starts, therefore, with the assumption that the 
arithmetic, harmonic, and geometric means are 
more appropriate for our purpose than are other 
and less familiar averages. 

The price relatives to be averaged are, as we 
have said, independent of the units of quantity 
with respect to which prices are expressed. This 
fact does not mean, however, that an average 
corresponding to a useful concept can be obtained 
without taking into account the aggregate 
quantities with which the relatives are asso- 
ciated. That is, in general we can not legiti- 
mately assume that it is possible to obtain a 
useful average of price relatives without weight- 
ing the relatives in proportion to quantities. 
Further, since the quantities of various com- 
modities are expressed in terms of incommensu- 
rable units — bushels, tons, yards, feet, number 
— and since, from the point of view of economics, 
the only unit which all quantities of all com- 


1 For other purposes (such as the measurement of seasonal 
variation) the median or mode may, of course, be more 
appropriate than the arithmetic, harmonic and geometric 
means. 
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modities have in common is value, it is neces- 
sary to use weights based upon values. 

The formulas of the weighted arithmetic (A), 
harmonic (H), and geometric (G) means for 
period m relative to period 1 (denoted by the 
subscripts), for m commodities and weights, 
indicated by W, are: 
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It will be noticed that the arithmetic and 
harmonic means are both based, in part, on the 
mathematical process of summation, and hence 
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are related to each other, while the geometric 
mean is based entirely on the mathematical 
process of multiplication. 

In the numerical illustration with which we 
started, the prices of wheat and pig iron in two 
periods, suppose that the quantities correlative 
to the prices of wheat are 10,000 and 10,000 
bushels (g: and gz) and of pig iron are 1000 and 
500 tons (g, and q), respectively, for the two 
periods. By combining these quantities with the 
corresponding prices (pg) we get four sets of 
weights based upon values, if we use the data of 
the two periods only, for the price relatives of 
each commodity in this binary comparison. 
These values and weights are: 


I I I Iv 

Commodity pig pigs pon poga 
IWikeaterrsteisieretts $12,500 $12,500 $15,000 $15,000 
Pig WO. ie oe os 8 25,000 12,500 20,000 10,000 
Aggregate....... 37,500 25,000 35,000 25,000 





pig /Zpign pig2/Zpig2 | poqi/Zpeqi | prq2/Xprg2 





Wheater aletats eis = 1g or .3316 1 or .50 $ or .428 % or .60 
Pig) Won cisco ae 24 or .6624 ¥% or .50 4 or .574 $ or .40 





The averages of the price of wheat and the 
price of pig iron computed forward (for period 2 
relative to period 1), weighted in four different 
ways, are: 
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Arithmetic mean.| }% or .0333 I or 1.0000} §% or .o714 | $% or 1.0400 
Geometric mean.. O15 979 -9518 1.0203 
Harmonic mean..} 5 of .9o0o | ¥¥ or .9600] }$ or .9333 I oF I.0000 





The averages of the price of wheat and the 
price of pig iron computed backward (for 
period 1 relative to period 2), weighted in the 
same four ways, are: 


I I III Iv 
Arithmetic mean .| 4? or r.1111 | $3 or 1.0417 | }§ or 1.0714] § or 1.0000 
Geometric mean.. 1.0920 1.0206 1.0506 .9801 


Harmonic mean. .} }§ or 1.0714] 1 or 1.0000 | §§ or 1.0294 | #§ or .g615 


Certain features of the averages in the pre- 
ceding tables are evident. First, when the base 
and type of average remain constant but dif- 
ferent weights are used the results differ sub- 
stantially from each other. Second, when the 
base and weights remain constant but different 
types of average are used, the arithmetic means 
are always larger than the geometric means and 
the latter are always larger than the harmonic 
means. Third, when the weights remain con- 
stant and the averages computed forward and 
backward are compared we find that: 

(2) The geometric means give consistent 
results, that is, the products of the geo- 
metric means with constant weights, com- 
puted forward and backward, always equal 
one. 


8 INDEX NUMBERS 


(b) The arithmetic means, however, give 
inconsistent results, that is, the product of 
the two arithmetic means with constant 
weights, computed forward and backward, 
are not reciprocals of each other, but their 
product is always greater than one. 

(c) The harmonic means, likewise, give 
inconsistent results, that is, the product of 
the two harmonic means with constant 
weights, computed forward and backward, 
are not reciprocals, but their product is 
always less than one. 

(d) If we take the geometric mean of 
the arithmetic and harmonic means with 
constant weights, computed forward, we get 
consistent results with the geometric mean 
of the arithmetic and harmonic means, with 
the same constant weights, computed back- 
ward, that is, their product always equals 
one. 

In attacking the problem of measuring the 
average change of the prices of two or more com- 
modities between two periods we confront at the 
outset two difficulties, illustrated by the example 
just considered. These difficulties are: 

(a) The selection of weights in cases for 
which, in actual practice, weights and 
relatives may or may not be correlated. , 

(b) The selection of a method of averag- 
ing which will be self-consistent, from the 
available methods which do not always give 
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reciprocal values when computed forward 
and backward. 

In the problem of averaging relative prices — 
the problem of index numbers — we assume that 
we have as our equipment, first, actual prices; 
second, corresponding quantities; third, price 
relatives; fourth, value weights, and fifth, the 
familiar averages—the arithmetic, harmonic 
and geometric means. 

We define an index number as an average of 
relatives, computed from actual economic data 
—a limitation which bars negatives, zeros, and 
infinites. We define an average as a function 
which has the properties that (a) if all the items 
averaged are equal to each other the average 
will be equal to each one of them also, and (0) if 
the items averaged are not all equal to each 
other the average will lie between the largest 
and smallest items. As we have said it is not 
our object to test every possible function having 
these two properties, but merely the averages 
corresponding to familiar mathematical processes 
and concepts — the arithmetic, geometric, and 
harmonic averages,’ all of which possess the 
general properties just specified. 


I. Tort PRoBLEM OF MEASURING THE AVERAGE 
. CHANGE OF PRICES BETWEEN Two PERIODS 


In searching for an answer to the question, 


1 The “‘aggregative”’ average is a special case of the arith- 
metic average. 
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“What is the average change of prices between 
two periods?” we encountered the following 
difficulties: 

(a) Uncertainty as to the direction and 
amount of bias introduced by weights con- 
structed from the pairs of products of the vari- 
ous prices and quantities for the two periods. 
One element in the weights, prices, always con- 
tributes to the positive or negative correlation 
of weights and price relatives; the other element, 
quantities, may or may not contribute to the 
correlation. In any case, the effect of the weights 
upon the resulting average of relative prices 
depends upon the correlation or lack of corre- 
lation between the weights, pg/Zpq and the 
relatives, p2/p:. The bias introduced on account 
of the correlation between weights and rela- 
tives to which we apply them will be called 
bias of the correlation of weights and associated 
relatives, or more shortly, weight correlation 
bias.} 

1T depart from Fisher in that I assume that we may in 
actual practice have any sort of correlation between prices 


and quantities. Thus, in the first edition of his text, Appendix 
I, page 384, he says: 


“In the text, systems I and II were summarily lumped 
together as practically the same, and likewise systems III 
and IV were lumped together. . . . Any index number under 
the system of weighting IIL must be larger than under weight- 
ing I. . . . In the ‘short’ step between III and IV, on the 
other hand, there is uncertainty. Any index number under 
III may be greater or less than under IV and may even 
possibly happen under very unusual circumstances to be much 
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(b) Inconsistent results for the arithmetic 
(and harmonic) ‘averages of price relatives with 
fixed weights when we shift the base of the rela- 
tives from one to the other of the pair of periods 
being compared (this inconsistency does not 
occur for the geometric mean, median, and 
mode). That is to say, bias will be introduced 


larger or smaller. It is a fair lottery. The high price relatives 
may draw either heavy weights or light weights with an even 
chance each way, as, likewise, may the low price relatives. 
The net effect will probably be an almost complete offsetting 
so that the final index number (IV) will probably be close to 
TII and may be either slightly above or below. 


According to my notion this begs the question. In actual 
practice both high positive and high negative correlation fre- 
quently occurs between prices and quantities. Fisher says 
(footnote, page 384), “It is, of course, conceivable that there 
is a correlation between the prices and quantities but this 
may be in either direction according as the prime mover is 
supply or demand.” All the work on statistical supply and 
demand curves of H. L. Moore and others indicates that in 
actual practice there is usually correlation between prices and 
quantities. Fisher continues, “Investigation of some New 
York Stock Exchange prices shows the same absence of corre- 
lation.” My investigations bring me to precisely the opposite 
conclusion — that is to say, when prices are moving upward 
rapidly, the largest volume of sales occurs in those stocks for 
which prices are skyrocketing. At other times, the largest 
volume of sales occurs in those stocks for which prices are 
falling most rapidly. 

In the case of the twelve crops for which I computed an 
index number of prices, Fisher necessarily admits the fact 
that there is an inverse correlation between quantity and 
prices. He adds that in this case “it is noteworthy that the 
effect on the curves given in Chapter XI is almost negligible.” 
It is negligible when pairs of years only are compared. For a 
series of years it actually results in a wider and wider divergence 
of the fixed base and chain indexes. 
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into an arithmetic or harmonic average whether 
one or the other base be selected. This bias will 
be called bias of the arbitrary base period, or 
more shortly, base bias. 

The difficulties on account of the bias of (a) 
weight correlation ane (b) base period were met 
as follows: 

In order to ascertain the effect of weights with 
certainty three special cases of prices and asso- 
ciated quantities were considered. These are: 

(1) The price -relatives, pe/p1, are as- 
sumed to be correlated with the associated 
weights, pg/Zpg, only through the price 
element appearing in each. 

(2) Prices and quantities are assumed to 
contribute equally in magnitude and direc- 
tion to the correlation between the price 
relatives, p2/pi, and their weights, pg/Zpq. 

(3) Prices and quantities are assumed to 
contribute equally in magnitude, but in 
opposite directions to the correlation be- 
tween the price relatives, p2/pi, and their 
weights, pg/2pq. 

For each of the three means (geometric, har- 
monic, and arithmetic) in each of the three 
special cases, a summary (Table 1) was made 
showing the number of equal contributions 
(up or down) to weight correlation bias by prices 
and quantities for each of the four combinations 


(pig, £192, P2gi, and pog2) appearing in the weights. 
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These four combinations are designated I, II, 
III and IV. 

It is obvious that it is possible to select pairs 
of geometric means from case (1) of Table 1 in 
which one member of each pair will have an 
upward weight correlation bias and the other 
member will have a downward weight correla- 
tion bias. These pairs, four in all, appear in the 
second column of Table 2. It is possible to 
select two such pairs of geometric means from 
case (2) and two from case (3) with opposite 
weight correlation bias. These pairs appear in 
the third and fourth columns of Table 2. Inspec- 
tion of the pairs of geometric means appearing 
for the three cases reveals the fact that two 
pairs (Ig and IVg) and (Ilg and IIIg) are com- 
mon to all. Consequently, we conclude that a 
geometric mean of relatives either with a com- 
bination of weights I and IV or of weights II 
and III would have no weight correlation bias 
whether the correlation between prices and 
quantities be perfect positive, zero, or perfect 
negative. Since these weights result in an aver- 
age with no weight correlation bias for the three 
pivotal cases of correlation between p and g it 
is likely that the same result would be achieved 
for the intermediate cases of correlation or a 
mixture of the three pivotal cases. Conse- 
quently, we conclude that these geometric 
averages, which we have numbered i and ii, are 
superior, for measuring the average change of 
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prices between two periods, to geometric aver- 
ages with weights which do not satisfy the 
condition. 

Further, since any geometric mean with fixed 
weights computed forward is the reciprocal of 
the geometric mean with fixed weights com- 
puted backward the base reversal test is satis- 
fied by the geometric averages i and ii. Any 
other average is considered to be logically infe- 
rior to these — for the purpose of measuring the 
average change of prices between two periods — 
if (a) it is not free from weight correlation bias 
and (b) it does not satisfy the base reversal 
test, z.e., the reversal of the base should result 
in reciprocal index numbers. That is, these 
geometric means, being free both from weight 
correlation bias and base bias, are logically 
superior to all averages not having these features. 

Other averages than the geometric mean can 
be found to satisfy both of the essential require- 
ments just specified. The arithmetic and har- 
monic means, however, as we have seen, do not 
satisfy the base reversal test. In fact, it is 
easily proven that the product of the arithmetic 
means computed forward and backward is al- 
ways greater than one and the product of the 
two harmonic means computed forward and 
backward is always less than one.! That is to 
say (using the notation defined on page 5): 


1 One assumption is implicit: the relatives averaged are not 
all equal. 
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Ja lool & bro 1 
But, it is also easily proven, assuming fixed 
weights, that! 
AD Ee 
(Ao.1°H2.1) (A j.2°H1.2) =) 


The equality just stated means that the prod- 
uct of the arithmetic mean and harmonic mean 
with the same weights in all means satisfies the 
base reversal test. Therefore, to obtain aver- 
ages (a) free from weight correlation bias and 
(b) which may satisfy the base reversal test we 
may take the square root of the product of an 
arithmetic mean and harmonic mean from Table 
1, choosing pairs of averages in which the weight 
correlation bias of one member of the pair will 
be opposite to that of the other member of the 
pair. The combinations of one arithmetic mean, 
and one harmonic mean, chosen in this fashion, 
are given for each of the three special cases in 
Table 3. There are three combinations, iii, iv 
and v, which are common to all three pivotal 
cases, and hence are free from weight correla- 
tion bias in these cases. Further, it is easily 
demonstrated that the combinations ili, iv and 


1 The proof follows: 


> le 
p > 
ge and His au 
= z (&)w 
Pi 


PRICES BETWEEN TWO PERIODS 19 


v satisfy the base reversal test. Therefore, the 
five combination averages of Tables 2 and 3 are 
(a) free from weight correlation bias and (0) 
satisfy the base reversal test. 

It will be noticed from Table 4 that formula 
v, the “ideal,” is identical with the geometric 
mean of (Ia, IIa, IlIz and IVh). That is to 
say, the “ideal” index of prices may be con- 
sidered to be the geometric mean of four constit- 
uent indexes of prices, two of these constituents 
being arithmetic méans of relatives, with 
weights figi/Zpigi and pig2/Zpige, respectively, 
and the other two constituents being harmonic 
mean of relatives with weights p.q:1/2poqi and 
p2q2/Zpq2, respectively. In still other words, 
the “ideal” index of prices, in a binary com- 
parison, is the result of using as weights for the 
relatives averaged all possible combinations of 
prices (p; and 2) with the associated quantities 
(g1 and qz) of the two periods. 

Combination formulas i to v are, however, 
not the only ones that can be obtained from 
Table 1 to satisfy our two tests. We may com- 
bine four geometric means, or two harmonic 
means with two arithmetic means, in such a 
way as to have offsetting weight bias for each 
of the three pivotal cases. There is, of course, 
only one combination of four geometric means, 
i.e. (Ig, IIg, IfIg and IVg), which satisfies the 
condition. This formula will be numbered vi. 

Table 4 gives the results of combining two 
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arithmetic means and two harmonic means in 
such a way as to have offsetting weight correla- 
tion bias for the three pivotal cases. In the first 
column of that table there appear eight formulas 
common to the three cases. Of these eight 
formulas six are combinations taken two at a 
time of the four formulas (iii, iv, v and v) in 
column 1 of Table 3. These six combinations, 
of course, satisfy the base reversal test since 
they are made up of pairs of means which satisfy 
that test. ‘ 

Formulas x and xi, however, do not satisfy 
the base reversal test and should be eliminated 
from our list of averages which are (a) free from 
weight correlation bias, (b) satisfy the base re- 
versal test and, therefore, are logically superior 
to other averages for measuring the average 
change of prices between two periods. Conse- 
quently, only nine formulas, i to ix, have been 
found which satisfy the two conditions (a) and 
(6) and of these, four, vi to ix, are combinations 
of pairs of formulas i to v. 

To recapitulate, the formulas we have found 
that satisfy tests (a) and (0) are: 
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> 2) wel 
Zin (4 Pad 


OME ery mes 
2() An ae 


(iv) Pips 3(h) es 
2( mi ) Pige ae 
p 


2 


@ (zee, 30) 
Lpiq2 Zig 


(vi) Geometric mean of i and ii 

(vii) Geometric mean of iii and iv 
(viii) Geometric mean of iii and v 
(ix) Geometric mean of iv and v 








It will be interesting to examine the actual 
results obtained by computing index numbers of 
prices, according to various formulas for aver- 
ages, for a special illustrative case. We have 
selected for this purpose the case used by Irving 
Fisher in the Making of Index Numbers! in which 
annual prices and quantities for 36 commodities 
for 1913-18 are utilized. Table 5 presents the 
price index numbers computed by using the 
weighted geometric, harmonic and arithmetic 
averages. Certain of these averages (Ia or IIIh, 
IIa or IV) may be expected, in theory, to give 


*Pp. 498-504. 
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fairly accurate results because the base bias of 
the formulas is opposite to the weight correla- 
tion bias. In all the other index numbers of 
Table 5 base bias does not tend to be neutralized 
or offset by weight correlation bias.1 

Table 6 presents a comparison of the nine 
index numbers satisfying the (a) weight correla- 
tion and (0) base reversal tests, for the same 
illustrative case of 36 commodities. These index 
numbers give remarkably consistent results. 
The range of variation of the fixed base index 
numbers for the nine formulas 1914-18 is as 
follows: 


I9QI4. I915 916 1917 1918 

100,00: 00.87 513:63 ITO1.37 176.63 
Range to to to to to 

100.12 100.06 114.59 163.10 177.80 


The greatest range is in 1917 when it amounts to 
slightly over one per cent. The two index num- 
bers, x and xi, which do not satisfy the base 
reversal test have values within the range given 
for the nine index numbers which do satisfy that 
test. 

Two generalizations may, apparently, be 
drawn from the results obtained from the special 
illustrative case of 36 commodities. First, index 
numbers which satisfy both the weight correla- 


1Jn the illustrative case of 36 commodities here consid- 
ered, the correlation between relatives and weights is prin- 
cipally through prices. 
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tion and base reversal tests give highly consist- 
ent results. Second, satisfaction of the weight 
correlation test is practically much more essen- 
tial for such consistency, than satisfaction of the 
base reversal test — since formulas x and xi 
give results highly similar to those of formulas 
satisfying the base reversal test. 

The formulas from Tables 2, 3, 4 and the 
additional formula coming from combination of 
four geometric means with different sets of 
weights gave us nine formulas for price indexes 
in a binary comparison, each of the price indexes 
being without (¢) weight correlation bias and 
(b) base bias. 

One and only one of the nine formulas for 
price indexes (v) satisfies a third test, that is to 
say, the factor reversal test, which may properly 
be set up for the selection, from the nine, of the 
logically best index. The factor reversal test 
states that the 

(Index of Prices) (Correlative Index of Quan- 
tities) = Ratio of Values 


or P21:'Q21=Va21 


where the index of quantities (Q21) is obtained 
from the index of prices (P21) by interchanging 
the ~’s and q’s and the index of values is the 
ratio Lpoge/=Upiqi. Other of the nine indexes 
than v may be shown, in actual practice, to very 
nearly fulfill the factor reversal test, but such 
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formulas, as we have said, do not, in general, 
fulfill it perfectly. 

We conclude that formula v for prices is logi- 
cally the best answer that we can give to the 
question, “‘ What is the average change of prices 
between two periods or places?”’ Moreover, the 
correlative or companion formula for quantities 
(obtained by substituting g for p and p for q in 
formula v) is logically the best answer that we 
can give to the question, ‘‘What is the average 
change of quantities between two periods or 
places?” 

We may point out in this connection that if 
we take amy average of prices (P2;) it is possible 
to find a formula in terms of the ’s and q’s 
which will satisfy both the base reversal test 
and the factor reversal test. This formula’ is 


1See Fisher’s the Making of Index Numbers, p. 390. 


Note on Fisher’s Method of Building Up Formulas to Satisfy 
His Factor Reversal Text (see The Making of Index Numbers, 
p- 396): 

Let P21 be any price index for period 2 relative to period 1, 
and V2, be the ratio of the aggregate values of period 2 to 
period 1. Then the “factor antithesis” of Pai is V21/Qe1, 
where Q2; is obtained by interchanging the p’s and q’s in Po. 

If we now “‘rectify”” P21 by taking the geometric means 
of P23; and its “factor antithesis” we get 

P» 1° Ve 
Qa1 
which, by its method of construction, will satisfy the factor 
reversal test. We now have a new formula which we propose 
to use as an index of prices. This new formula is the geometric 


mean of 
P:, and V; 1/Qa1 
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P, oOo 1 eis)" 
21°Qi2°* (2piqn)? 


Although this formula satisfies the two tests it 
is, in general, neither an average of prices nor 
an average of quantities. The formula is not, 
therefore, of use in obtaining an answer to the 
question, “What is the average change of prices 
or quantities between two periods or places?” 

In connection with the ‘‘ideal” index (v) we 
may point out that it is simultaneously: 


the square root of the product (Ia-IIa); 
the square root of the product (IIIA-IIa); 
the square root of the product (Ia-IVh); 


The first factor, P:1, is the original index of prices with which 
we started. The second factor V21/Q2i is also supposed to 
be an index of prices. It is obtained by dividing the index 
of values V2, by the index of quantities Qzi, correlative to 
our original index of prices. 

May we properly call the factor V21/Qs1 an index of 
prices? We agree with Fisher (p. 3) that we cannot, for, as 
he says, ‘fan index number of the prices of a number of com- 
modities is an average of their price relatives.’”’ In general, 
the factor cannot be expressed as a function of price relatives 
and cannot be called an average of prices. 

If the first factor is a proper index of prices and the second 
factor is not, the mean of the two factors (P21. V21/Qa1)? 
cannot, in general, be called an index of prices. Fisher, never- 
theless, makes the finding and testing such formulas the 
central feature of his book. Further, when he compares the 
results obtained by using such formulas for the special case 
of 36 commodities 1913-18 he erroneously calls the formulas 
“index numbers” (Table 28, pp. 246-7), whereas many of the 
formulas are not proper averages and, therefore, not index 
numbers. 
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the square root of the product (III/-IVA); 
the fourth root of the product (Ia-IIa-IIIh-IVA). 


We have stated that only one index (v) among 
the nine exactly, and in general, satisfies the 
factor reversal test, but that the other indexes 
very nearly satisfy it. In connection with the 
latter part of the statement, 7.e., “that the other 
indexes nearly satisfy the factor reversal test,” 
let us consider the price index constructed from 
geometric means of relatives (say, formula vi). 


W 
Consider the index, n(®) , where 
1 


W =14 (pign/ Epigit Prge/ Dpigat poi / Zp 
+ 2q2/Zp2q2) and the correlative index of 


quantities, IT n(#).. The product of these two 
q 


1 


indexes is u( Pe ny" . The numerator is the 
141 
weighted product of the values fog. and the 
denominator is the weighted product of the 
values pig. 
The question which we are considering is the 
divergence between the following ratios: 


the weighted product of pogs 
the weighted product of pig 





and the aggregate of p2ga_ 
the aggregate of pig 
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We cannot tell in general what the divergence 
between the two ratios will be, but we can 
ascertain the divergence for a special case. Let 
us consider the special case of 36 commodities 
for 1913-18, used by Fisher, and compare the 


TABLE 7.— VALUE RATIOS COMPARED WITH 
Propuct RATIOS FOR 36 COMMODITIES, 1913-18 





Product Ratios 


Year Value Ratio 


Formula (vi) | Formula (i) Formula (ii) 
1913 100.00 100,00 100.00 100.00 
1914 99-45 99-43 99-42 99-45 
IQI5 108.98 109.03 109.07 109.00 


1916 | 135-75 135.56 135.37 135.74 
IQI7 192.23 192.53 192.60 192.46 
1918 222.71 222.47 222.26 222.68 








ratios of aggregate values with the ratios of 
weighted products (see Table 7). 

It will be seen from the second and third columns 
of the table that in this special case the products 
of the indexes of prices with correlative indexes 
of quantities, using index formula vi, for the 
years 1914-18 is never more than 16/100 of one 
per cent from the value ratio obtained directly. 
The maximum deviation of the products of the 
price and quantity indexes according to formulas 
i and ii, respectively, are 28/100 and 12/100 of 
I per cent. Formulas i, ii and vi, in this special 
case, therefore, all come within 28/100 of one 
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per cent of satisfying the factor reversal test for 
prices and quantities. 

The percentage deviations just given are less 
than the deviations of the price indexes, com- 
puted from these formulas and the price index 
computed from formula v, which perfectly sat- 
isfies the factor reversal test. The maximum 
deviation of price index vi from price index v 
for the same year is 2 parts in 1142 or less than 
two tenths of one per cent. The deviations of 
indexes of prices as computed by formulas i, ii 
and vi from the indexes computed by formula v 
(the ideal) for the special case are as follows: 








Actual Deviations from Price Index v 











Price 
Index 

IOQI4 IQIS 1916 IQI7 1918 
(i) ro) +.05 —.38 +.49 +.15 
(ii) ° —.02 —.02 —.19 —.31 
(vi) ° Or —.20 +.15 —.08 

Price Index v 

(v) t00.12 | 99.89 | 114.21 | 161.56 | 177.65 




















We may justly characterize the price index v 
as logically superior to the other indexes that we 
have examined (and in fact superior to any other 
known index) for a binary comparison.1 The 

1 The “ideal” index (v) is logically superior to other indexes 
for a binary comparison only. For a comparison of three or 


more periods each with the other, we shall show that it is 
impossible for an average to satisfy both the factor reversal 
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logical superiority of v to the other indexes of 
our group of nine results from the fact that v 
alone satisfies the factor reversal test. But the 
other indexes very nearly satisfy that test. The 
factor reversal test, therefore, merely gives us a 
basis for selecting one of a number of indexes, 
all of which give highly similar results (as shown 
by an illustrative case) in a binary comparison. 
That is to say, the factor reversal test enables 
us to select the logically best index from a num- 
ber of logically good indexes which, when tested 
in actual cases, give highly similar results. In 
still other words, the factor reversal test selects 
one index from a number of indexes in the same 
homogeneous group, both logically and actually; 
it does not select a good index from a hetero- 
geneous group of indexes. Finally, all of the 
tests refer to a binary comparison only. 


Effect of weights on various items of any series. 
Suppose that we consider a case of prices and 
associated quantities of the most general sort, 
z.e., in which some of the price and quantity 
relatives are positively correlated, some are 


test and the circular test (the latter being the generalization 
for three or more periods of the base reversal test for two 
periods). In such a case — of partially conflicting criteria — 
where it is impossible for us to decide whether one criterion 
is more important than another, the formula which comes 
closest to satisfying all of the criteria, although it satisfies none 
exactly, appears to hold as strong or a stronger logical] position 
than another formula which satisfies some of the criteria 
exactly and the others with a considerable margin of error. 
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negatively correlated, and some are correlated 
neither positively nor negatively. Assume that 
the formula (v) found above for prices and the 
companion formula for quantities are utilized in 
computing index numbers for this general case. 
We will call these index numbers P and Q. The 
items of the series (the general case) may be 
divided into five subgroups, as follows: 


For subgroup (1) p2/fi>P and q2:/™m20 
For subgroup (2) f2/fi>P and q2/qa<Q 
For subgroup (3) f2/pi1<P and q2/m20 
For subgroup (4) p2/fi<P and q@/qai<Q 


For subgroup (5) f:/fi=P and ge/n=0 


For subgroup (1) of any series pog2/Dpoq2> 
pigi/=piq. and for subgroup (4) poge2/Zpaq2< 
pigi/=pigi. Also, there may be a section (a) of 
the subgroup (2) for which the first of the above 
relationships between the weights holds true and 
a section (b) of the subgroup (3) for which the 
second relationship holds true. We conclude 
that: 

For all items in subgroups (1), (2a), (36) and 
(4) the use of given year weights will tend to 
result in a larger price index than that obtained 
from base year weights (and also larger than P, 
which presumably has no weight correlation 
bias). 

For all items in subgroup (5) variation in 
weights will not tend to change the price index P. 
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For all items in subgroups (25) and (3a) the 
use of given year weights will tend to result in a 
smaller price index than that obtained from base 
year weights (and also smaller than P, which 
presumably has no weight correlation bias). 

For the whole series, the use of given year 
weights will actually result in a larger index than 
that obtained from the use of base year weights 
unless, first, there are one or more relatives in 
subgroup (20) with negatively correlated weights, 
or/and in subgroup (3a) with positively corre- 
lated weights and, second, the downward effect 
of the weight bias for these relatives is more than 
sufficient in magnitude to offset the upward 
effect of the weight bias of the remainder of the 
series. 


For subgroup (2) of any series we may easily 
prove that poqi/Zpoqi> pig2/Zpig2 and for sub- 
group (3) that pogi/Dpog1< pig2/Zpige. Also, there 
may be a section (a) of the subgroup (1) for 
which the first part of the above relationship 
between the weights holds true and a section 
(6) of the subgroup (4) for which the second 
part of the relationship holds true. We conclude 
that: 

For all the items in subgroups (1a), (2), (3) 
and (4b) the use of given year prices and base 
year quantities in weights will tend to result in 
a larger price index than that obtained from the 
use of base year prices and given year quantities 
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in weights (and also larger than P, which pre- 
sumably has no weight correlation bias). 

For all items in subgroup (5) variation in 
weights will not tend to change the price index P. 

For all items in subgroups (10) and (4a) the 
use of given year prices and base year quantities 
in weights will tend to result in a smaller price 
index than that obtained from the use of base 
year prices and given year quantities in weights 
(and also smaller than P which presumably has 
no weight bias). 

For the whole series, the use of given year 
prices and base year quantities will actually re- 
sult in a larger index than that obtained from 
the use of base year prices and given year quan- 
tities in weights unless, first, there are one or 
more relatives in subgroup (1b) with negatively 
correlated weights or/and in subgroup (4a) with 
positively correlated weights and, second, the 
downward effect of the weight bias in these 
relatives is more than sufficient in magnitude to 
offset the upward effect of the weight bias of the 
remainder of the series. 


II. THe PRoBLEM oF MEASURING THE AVERAGE 

CHANGE OF PRICES BETWEEN THREE OR MORE 
PERIODS 

If P21, Ps, and P32 are the best index numbers 

for comparing price changes between pairs of 

three periods (1, 2, 3) does it follow that they 

are the best to use for a simultaneous comparison 
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of each year with both other years, such a com- 
parison as we make when we plot three or more 
index numbers on the same chart? 

First, it appears that if P21 represents the 
average of prices in period 2 relative to period 1 
and P3; represents the average of prices in 
period 3 relative to period 1 then P3,/P21 should 
equal P32. But this relationship will not hold in 
general for any index in which the weights of the 
relatives in the three values P3;, Poi and P32 
differ from each other., In other words, in order 
that the relationship P3:/P21=P3:2 hold true in 
general we must have fixed weights. Thus, for 
the ideal index (v) to satisfy the circular text we 
should have 


i i = Sar es Z pq, Zp sg2 oes) 2 
ZAigi LPigs ZPigi Vpigz Upeg2 Ups 


and Ziq. _ Zh, 2PrIs 


Lpsq2 Zpq3 Upige 
It is evident, therefore, that formula v does not 
reduce to an identity and will not satisfy the 
circular test unless there is a certain special 
relationship between the p’s and q’s, 7.e., the rela- 
tionship indicated by the preceding equation. 
If we assume (r) any three sets of prices (f1,/2 
and #3) in the three periods and (2) correspond- 
ing quantities for all commodities except one 
for one of the periods we can, by solving the 
equation of condition given above, find the value 
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that the single remaining g must have to satisfy 
the equation and, therefore, the condition that 
the circular test be satisfied in the special case. 
There are, clearly, any number of special cases 
for which the circular test will be satisfied. We 
are not, however, concerned with the question 
as to what may happen in these special cases 
but rather with the question, ‘‘Is it possible, 
assuming the ’s to take any sets of values 
assigned, to have the equation of condition re- 
duce to an identity?” The answer is obvious: 
only if the quantities in the three years are the 
same (¢:1=q:=qs). That is to say, the aggrega- 
tive formula for a price index will satisfy the 
circular test (assuming the p’s to move as they 
will) only if the corresponding quantities are 
constant. 

Likewise, the other formulas of our list of nine 
will not satisfy the circular test (i.e., that the 
quotient of amy pair of index numbers to any 
base, P,;/P;;, equals the index number obtained 
for one of the pair of periods to the other of the 
pair as the base, P,,) unless the weights are 
fixed.! If we assume fixed weights all of our 

1The condition that the geometric formulas i, ii and vi 
should satisfy the circular test for any three periods 1, 2 and 

1S 
: Psi=Pii* Ps. 0F 


n(ayraa rem 


(where the W’s stand for different sets of value weights) 
or psVa —Wa= psa —Warpy, Wai—War 
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nine formulas reduce to two, (1) the geometric 
mean of relatives with fixed weights and (2) the 
aggregative mean with fixed quantities. These 
two and only these two forms of our averages, 
therefore, (1) are without weight correlation 
bias, (2) satisfy the base reversal test, and (3) 
satisfy the circular test. 

If we use any fixed weights in the geometric 
average and amy fixed quantities in the aggre- 
gative average the third condition named above 
is fulfilled. But we will not get good indexes of 
prices from these formulas unless, in the geo- 
metric average, the weights are based on prices 
and quantities which may justly be retained for 
all the periods in the interval for which index 
numbers are to be computed, and wmnless, in the 
ageregative average, the quantities may justly 
be retained for all the periods in the interval for 
which index numbers are to be computed. 

For a binary comparison, the obvious fixed 
weights to use in the geometric mean are 
V4 (pig/Zphigt poqg/Zpq), which are the weights 
of formula i with fixed q’s where g=the mean of 
gq and q, say g=}4(m+q). Likewise, for an 
ageregative mean, the obvious fixed quantities 
to use are 14(gitq2). For a large number of 
This will reduce to an identity for all values of pi, po and ps 
only if 

Wsi=Wri=Wss 
That is to say, the geometric means will satisfy the circular 


test, in general, only if the weights are fixed and constant for 
the three periods. 
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periods () the obvious quantities to use are 
1/n(qitq+***Gn). The arithmetic mean of 
quantities gives a satisfactorily simple method 
of getting fixed weights for the aggregative 
formula, but in the geometric formula, the use 
of the fixed quantities with the price of each 
and every period is an unnecessarily laborious 
method of securing weights. We may substitute, 
therefore, in practice value weights obtained by 
combining (1) an average of prices for part or 
whole of the interval in question with (2) a 
corresponding average of quantities. 


It will be observed from the preceding argu- 
ment that the index numbers thus obtained (sat- 
isfying the circular test) will give useful results 
only for homogeneous intervals. This observa- 
tion leads to the generalization that accurate 
comparisons are not possible over long intervals 
in which the q’s have changed radically. Further, 
even though we limit ourselves to a binary com- 
parison of periods in which commodities and 
quantities are vastly different (say 1776 and 
1928) the ‘‘ideal”’ formula would be of little aid 
in constructing an index of general prices. The 
difficulty is not in the lack of an average to use 
but in the data to be averaged. 


That the circular test is the logical extension 
of the base reversal test follows from the fact 
that a necessary condition for the satisfaction 
of the circular test, constant weights, is also a 
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necessary condition for the satisfaction of the 
base reversal test. 

Necessary condition for satisfaction of the base 
reversal test. It will be noticed that formula v, 
the ‘‘ideal” form, is really a function of 2” rela- 


tives (2 and pr for nm commodities ; Formula 


1 pr 
v, with respect to period 1 as base, is 
pr Vas 
1 rr 191 
(Obst . Lpidt f 
Dpo2 


This may be described as the square root of the 
product of 
Reciprocal of the arith- Arithmetic mean of 
metic mean of relatives ! the relatives p2/p1 
p/p. weighted by the weighted by the ratios 
ratios pog2/Dp.qa pig: /ZPin 
Formula v, with respect to period 2 as base, is 


M4 
Cre 3 
Lpoq2 2(O pa 
Lpigi 
This may be described as the square root of the 
product of 
Arithmetic mean of Reciprocal of the arith- 
the relatives p:/p2 metic mean of the rela- 
weighted by the ratios _) tive p:/p. weighted by 


P2Q2/ Zp2Q2 the ratios pigi/Zpiqi 
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The two expressions for the formula v obtained, 
respectively, by using each of two periods as the 
base both contain the same relatives with the 
same weights applied to those relatives. In 
other words, if we select the relative for any 
commodity and ascertain the weight of that 
relative in formula v, computed forward (with 
one of the two periods as the base), and then 
ascertain the weight of the same relative in 
formula v, computed backward (with the other 
of the two periods as the base), we shall find that 
the weights are identical. In still other words, 
the weight of the relatives with respect to each 
other and the weights of the reciprocals of the 
relatives with respect to each other are the same 
whether we compute the index forward or back- 
ward. The base reversal test is satisfied by for- 
mula v not because of changing emphasis (for the 
emphasis does not change) on the individual 
price relatives, but because the relatives that 
appear in the numerator of the index computed 
forward occur in the denominator of the index 
figured backward. The same generalization is 
true of formulas ili, iv and vii to ix inclusive. 

It will also be noticed that in the geometric 
means of relatives, formulas i, ii and vi, the 
weights of the relatives computed forward are 
the same as the weights of the relatives com- 
puted backward, but the relatives appear in the 
numerator in one case and in the denominator 
in the other. Consequently, geometric means of 
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relatives which satisfy the base reversal test and 
have no weight correlation bias, in fact have 
constant weights figured forward and backward. 

In all of our nine formulas, therefore, the 
weights of the individual relatives for the indexes 
computed forward and backward are fixed. That 
is, even in a binary comparison the only averages 
of relatives that we have secured which (a) have 
no weight correlation bias and (0) satisfy the 
base reversal test are averages in which every 
relative has the same weight whether the first 
or the second period be taken as the base. To 
express the same idea in other words, in a binary 
comparison the weights of relatives actually com- 
puted for formulas i to ix inclusive with one 
period as the base are applied to precisely the 
same relatives when the other period is used as 
the base. 

It appears further that, if any average is to 
satisfy the base reversal test a necessary condi- 
tion is that the weight applied to each relative 
when one of the two periods is taken as the base 
must be the same as the weight applied to the 
same relative when the other of the two periods 
is taken as the base. 

The obvious generalization of the base reversal 
test, which involves two periods only, is the cir- 
cular test, which involves three or more periods. 
A necessary condition for satisfaction of the base 
reversal test is fixed weights when the index is 
computed forward and backward. Likewise a 
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necessary condition for the satisfaction of the 
circular test is the use of the same fixed weights 
for all periods. 


III. CoMpARISON OF FIXED WEIGHT INDEXES 
WITH VARIABLE WEIGHT INDEXES FOR THREE 
OR More PERIODS. 


Professor Irving Fisher selected the formula 
which is numbered v in our system as ‘‘the best”’ 
or ‘‘the ideal” index number for an intercom- 
parison of three or more periods as well as for a 
binary comparison. He selected the ‘‘ideal” 
index number from 13 formulas (these 13 are 
not all true index numbers, however) satisfying 
the base reversal and factor reversal tests! be- 
cause the ‘‘ideal’” was not only judged to be 
simpler in form than the others, but also because 
he had awarded it the first place in accuracy.? 
Taking the “‘ideal’’ formula as his standard of 
accuracy, Professor Fisher then ranked 119 
formulas, on the basis of a comparison of the 
actual index numbers obtained for the years 
1914-18 relative to 1913 for the special case of 
36 series of commodity prices and their asso- 
ciated quantities. Of the 119 formulas, the 13 
ranking the highest — that is, most closely ap- 
proximating the values of the “ideal” formula 
— according to Fisher, are the following: 

1 The Making of Index Numbers, pp. 220-21. 


2 Tbid., Table 28, pp. 244-47. 
® The Making of Index Numbers, Appendix VI, pp. 489-99. 
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Ranking 

I, 2,3 (x) 
4, 6, § (4) 
9, 7) 8 (9) 


13, 12,20 (13) 


Nature of Formulas 


Geometric mean of 2poqi/Dpign 
and Ypog2/=Zpig2, (2) arithmetic 
mean of same pair of aggrega- 
tives, and (3) factor antithesis 
of the last named formula, re- 
spectively, 

Arithmetically crossed weight 
aggregative 

[2 po(qitge)/Zpilqit+gs)], 

(6) factor antithesis of the pre- 
ceding formula, and (5) the 
geometric mean of the two 
formulas last named, respec- 
tively. 

Geometrically crossed weight 
aggregative 

[DpeV quge/EpPrV 192], 

(7) factor antithesis of the pre- 
ceding formula, and (8) the 
geometric mean of the two 
formulas last named, respec- 
tively. 

Geometric average of relatives 
with geometric crossed weights, 
(12) factor antithesis of the pre- 
ceding formula, and (10) the 
geometric mean of the two pre- 
ceding formulas last named, re- 
spectively. 
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Ranking Nature of Formulas 


II (rr) Geometric mean of 8 geometric 
averages of relatives obtained 
by using each of the 4 sets of 
value weights and combining 
the index numbers, thus se- 
cured, with their factor antith- 
eses. 


The 13 formulas listed above, which Fisher 
ranks highest with respect to his chosen standard, 
are not the same 13 which he had previously 
selected as satisfying the base reversal and factor 
reversal tests. In fact, the formulas ranking 2 
and 3 satisfy neither test, and only those rank- 
ing 5, 8, 1o and 11 satisfy the factor reversal 
test. In fact, the 4 formulas last named were 
adjusted or ‘‘rectified”” by Professor Fisher in 
order to make them satisfy the factor reversal 
test. These formulas are not, however, according 
to my notion, or according to Professor Fisher’s 
definitions, true index numbers. Of the 13 rank- 
ing formulas listed above only those numbered 
1, 4, 9 and 13 are true index numbers. 

The index number used as the standard of 
comparison — and, therefore, automatically 
ranked first — is the ‘‘ideal.” 

The index number (not formula) ranked next 
is the arithmetic average of relatives with value 
weights obtained by using base year prices with 
the arithmetic means of quantities for the two 
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years being compared. The index number ranked 
third is also-an arithmetic average of relatives 
with value weights obtained by using base year 
prices, but these prices are combined with the 
geometric means (instead of arithmetic means) 
of quantities for the two years being compared. 

The index number ranked fourth is the geo- 
metric average of relatives with value weights 
obtained by taking the geometric mean of the 
values in the two years being compared. 

It is possible to generalize the index number 
ranked next to the standard (in the binary com- 
parison) for an intercomparison of three or more 
periods. The constant value weights indicated 
in such a generalization would be obtained by 
using base year prices with the arithmetic mean 
of quantities for all the years being compared. 

It is possible also to generalize, for a compari- 
son of three or more periods, the index number 
ranked third to the standard. The constant 
weights that would be indicated in such a gen- 
eralization would be obtained by using base year 
prices with the geometric means of quantities for 
all the years being compared. The only dif- 
ference, it will be noticed, between the general- 
ization thus indicated and that indicated in the 
previous paragraph is the difference between 
arithmetic and geometric means of the same set 
of quantities in the weights. Such a difference 
in weights would clearly result in a difference in 
the resulting index numbers of the nature of a 
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higher order infinitesimal compared with the dif- 
ferences resulting from different methods of 
averaging and weighting the relatives themselves. 
Therefore, only one of the generalizations need 
be retained for actual use in computing index 
numbers. The generalization to be retained 
should be, naturally, the simpler of the two, that 
is, the one utilizing in obtaining weights the 
arithmetic, rather than the geometric, average 
of quantities. 

It is possible to generalize, also, the index 
numbers ranked fourth to the standard for an 
intercomparison of three or more periods. The 
constant value weights indicated in such a gen- 
eralization would be obtained by using the geo- 
metric mean of values for all the year being com- 
pared. It is evident, however, that the simpler 
arithmetic mean of values could be substituted 
in actual practice, for the geometric mean. Such 
a substitution, in actual practice, would result 
in a negligible difference. 

We have, therefore, secured two index num- 
bers generalized for an intercomparison of three 
or more periods, one an arithmetic average of 
relatives with an appropriate set of constant 
value weights and the other a geometric average 
of relatives with another appropriate set of con- 
stant value weights. These index numbers are 
the same as those derived on page 92 of Section 
II of the present article. The algebraic expres- 


COMPARISON OF INDEXES 49 


sion for these index numbers for the period i 
relative to the period 1, respectively, 


G+ +e. “+n 


= pig/=piq where g= : 
and n( 2)" were pey tha 
1 
andy Csseat Ps. 
un 


We may note in passing that these two index 
numbers computed for each of the five binary 
comparisons of prices for the years 1914—18 rela- 
tive to 1913, using Professor Fisher’s data for 
36 commodities, give results remarkably close to 
those given by the ‘‘ideal’”’ formula. The actual 
index numbers for prices thus computed are as 
follows: 


TABLE 8.— BINARY COMPARISONS OF THREE INDEX 
NuMBERS OF Prices, BASE: 1913* 





Weighted averages | 1914 | 1915 | 1916 | rgI17 | 1918 


ideally. tos factors cen 100.12] 99.89] 114.21] 161.56) 177.65 
Arithmeticf...... 100.12] 99.89] 114.23] 161.52| 177.63 
Geometric........ 100.1 | 99.9 | 113.8 | 162.1 | 177.8 








* Data from The Making of Index Numbers, pp. 247. 
{ Aggregative mean. The weights used are indicated by the 
formula on p. 39. 
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We may also note in passing, first, that in a 
binary comparison the arithmetic average of 
relatives with weights obtained by using base 


year prices and NT 9 will always lie between the 
2 


two constituents of the “ideal”? index and, 
second, that Professor Fisher has been unable 
to find any actual case in which the two index 
numbers differ by more than three-fourths of 
one per cent and, he remarks, “this, in most 
cases, would not be considered worth taking into 
account.’ 

Table 9 presents a comparison, for Fisher’s 
special case of 36 commodities, of two index 
numbers of prices with fixed weights — the 
weighted geometric and arithmetic averages of 
relatives — with the ‘‘ideal’” formula.? The 
maximum percentage deviation of the arithmetic 
mean with fixed weights from the ‘‘ideal”’ 
formula with variable weights is 0.33 of one per 
cent in 1914, and that of the geometric mean 
with fixed weights is 0.67 of one per cent in 1916. 

Table 10 presents a similar comparison of 

1 The Making of Index Numbers, Appendix I, p. 430. 

2 Tt is noteworthy that among the scores of index numbers 
which Fisher computed for his 36 commodities, he includes 
only one fixed weight formula, No. 9051, p. 518. This is a 
“round weight” formula obtained “‘after judicious shifts of 
decimal points of the 36 quotations.” Even this round weight 
formula he classified as a “fair index number.”’ If weights are 
more carefully chosen, fixed weight index numbers would have 


to be classified, using Fisher’s terminology and standard of 
excellence, at least as ‘‘ very good.” 
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index numbers of quantities. The maximum 
percentage differences between the arithmetic 
and geometric means with fixed weights on the 
one hand, and the ‘‘ideal’”’ formula on the other, 
are, respectively, 0.38 of one per cent in 1914 
and 0.56 of one per cent in 1916. 

Tables 11 and 12 present comparisons of two 
index numbers, arithmetic and geometric means, 
with fixed weights for Fisher’s special case of 36 
commodities, the index numbers, however, be- 
ing computed for three sub-groups as well as 
for all commodities. The maximum absolute 
divergence of corresponding indexes for quanti- 
ties is in 1916 and for the subgroup of “‘15 com- 
modities with weights of less than one per cent 
each.” In this instance, the weighted geometric 
mean is 121.23 as compared with 116.64 for the 
weighted arithmetic mean, a divergence of about 
4 per cent. In 1914 and rors the absolute diver- 
gence was nearly as great as in 1916. 

Table 13 presents comparisons of the value 
ratios for the 36 commodities and each subgroup, 
on the one hand, with the products of correspond- 
ing indexes of prices and quantities, on the other 
hand. This comparison shows the extent of the 
deviation of the products of corresponding fixed 
weight indexes for prices and quantities from 
perfect satisfaction of the factor reversal test. 
For the 36 commodities the maximum percentage 
deviation of the product of corresponding arith- 
metic indexes for prices and quantities is 0.71 
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of one per cent in 1914 — in all other years the 
deviation is less than one-half of one per cent. 
For the same group of commodities the maxi- 
mum percentage deviation of the products of 
correlative geometric indexes for prices and 
quantities is 1.22 per cent in 1916. Among the 
subgroups there is an especially marked diver- 
gence in the subgroup of 15 commodities. In 
this subgroup the divergence is most pronounced 
in 1916. 

The divergence of the geometric and arith- 
metic indexes for the subgroups for 15 com- 
modities is so substantial and so much greater 
than in other subgroups, as shown by Tables 11, 
12, and 13 that a special investigation of the 
source of these discrepancies was undertaken. 
Since the discrepancies appeared more emphati- 
cally in the indexes for quantities than for prices, 
attention was directed to the quantity relatives. 
The data for the indexes of quantities for the 
subgroup of 15 commodities for the year 1916 
are given in Table 14. 

Table 14 reveals a remarkable situation among 
the quantity relatives and their weights in the 
subgroup for 15 commodities. It happens that 
for every one of the larger relatives, 7.e., over 
121.23, the weights are heavier for the geometric 
mean than for the arithmetic mean and for all 
of the small relatives, z.e., under 116.64, except- 
ing that for anthracite coal, only, the weights 
are lighter for the geometric mean than for the 
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arithmetic mean. The large discrepancy between 
the two means, therefore, arises from the acci- 
dental positive correlation between the weights 
and relatives of the geometric mean as compared 
with that between the weights and relatives of 
the arithmetic mean. If we use the geometric 
weights in the arithmetic mean of relatives we 
get an index number of 117 (instead of 116.64) 
and if we use the arithmetic weights in the geo- 
metric mean of relatives we get an index number 
of 106 (instead of 121.23). It is obviously the 
weights rather than “the method of averaging 
that produces the discrepancy in this instance. 


IV. Spectat DirFicuLTIeS (OTHER THAN THE 
PROBLEM OF AVERAGING GIVEN HOMOGE- 
NEOUS DATA) OF OBTAINING SATISFACTORY 
Price INDEXES FOR Two or More PERIODS 

OR PLACES 


In our discussion up to this point we assumed 
that we had at our disposal data for prices and 
corresponding quantities for a number (”) of 
commodities or articles in one period (1) and 
for precisely the same commodities or articles 
for another period (2). In assuming that there 
were 2 commodities in both periods we implicitly 
assumed also that we had m finite price relatives, 
i.e., that none of the p’swere zero or infinite, and 
that we had n finite quantities, z.c., that none 
of the q’s were zero or infinite. That is to say, 
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we excluded free goods and goods not common 
to both periods. Upon the basis of these assump- 
tions we found satisfactory averages of prices to 
use either for a binary comparison or for a com- 
parison of two or more periods (the aggregative 
formula with fixed quantities obtained by taking 
the aggregates of the quantities for all the periods 
being compared). 

In any actual problem of finding the average 
change of prices between any two years we must, 
at the outset, assemble the data. The difficulties 
encountered in this process are: 

(1) Definition of the field. Is it our object to 
measure the average change in prices of material 
commodities only, such as wheat, hides, potatoes, 
automobiles, vacuum sweepers, chairs and houses, 
or of material commodities and services, such as 
electrical power for light and refrigeration, taxi 
rides, haircuts, voyages to Europe, seats at the 
theatre, rooms at hotels, and wages of servants, 
or of material commodities, services and invest- 
ments, such as stocks, bonds, houses and land? 
Do we want to limit the field to certain varieties 
of economic goods, such as building materials or 
food, or certain classes of prices, wholesale or 
retail, or certain quantities of commodities, such 
as to exclude commodities with nominal quan- 
tities in either period? Do we want to include 
raw materials, partly manufactured goods and 
finished goods? Shall we include by-products, 
patented articles, and monopolistic goods? 
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Shall we include or exclude the long list of 
articles, like fabrics, shoes and mah-jong sets in 
which the style element is a leading factor in 
price? Do we want to have reflected in the 
“average change of price” (which we are 
attempting to get) changes in supply and 
demand connected with the individual com- 
modities, regardless of the nature of the changes, 
and changes on the money side, regardless of 
their nature? In general the definition of the 
field is determined by the purpose of the index 
number to be constructed. 

(2) It is difficult to get p’s and q’s that exactly 
correspond. ‘There may be, for instance, five or 
ten different grades of a commodity, such as 
wheat. If we take only one or two series of 
price quotations we must face the question: 
‘Should the corresponding q be the quantity of 
that grade of wheat or of all wheat sold?” The 
answer to this question depends upon the pur- 
pose of our index. Often we are obliged to use 
quantities that correspond exactly neither to our 
individual price series nor the total of the com- 
modity represented by our individual price series. 
Further, we may not be able to find q’s to match 
prices for every time unit (week, month, quarter, 
or year) for which we desire price indexes. Even 
though we desire to use variable weights we can 
not get them, usually, for as frequent intervals 
as we can get prices. _ 

(3) Procedure if q or g2 equals zero. Suppose 
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that our object has been completely defined and 
it is to secure a comprehensive index of the cost 
of living in 1928 as compared with that in 1870. 
We find that in 1870 lighting was by kerosene 
lamps and that kerosene was a significant item 
in the budget whereas in 1928 lighting is by 
electricity. Electric power was not sold in 1870 
so that for this element in the budget gi=o. We 
are therefore obliged to omit electric power as 
such. But we might eliminate both kerosene and 
electricity from our list of price series and sub- 
stitute “domestic illumination” in lieu of them. 
If we do this we would have to determine upon 
homogeneous units of domestic illumination in 
the two periods, and ascertain the number and 
prices of those units in 1870 and 1928. If this is 
impossible or inexpedient we must be satisfied 
with an index number which reflects only part 
of the changes in the cost of living. Excluding 
electricity we ought also to exclude kerosene 
from our list of elements in the cost of living. 
(4) Changes in the nature of the commodity with 
time. A similar problem occurs when we are 
comparing, say, the cost of food in July with 
that in January. The commodity, potatoes, for 
instance, means mainly old potatoes in January 
and new potatoes in July. Instead of one series, 
we must have two, and if either series disappears 
entirely in one or the other period (q; or g2=0) 
the commodity must be eliminated from our list. 
Likewise, we may find it necessary to subdivide 
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other seasonal commodities into several homoge- 
neous commodities, such as eggs into the classes 
depending upon the age of the commodity. 
There are also other commodities than foods, 
such as ‘‘packer’s heavy native steer hides,” 
which have a seasonal variation in price because 
the commodity changes with the seasons. Thus, 
steer hides of a fixed grade are cheaper in the 
spring than in the autumn because the hides 
are apt to have blemishes acquired during the 
winter season which are not acquired to the 
same extent in summer. In this case it is logical 
to make some adjustment for the seasonal change 
of price if the commodity is to be retained in our 
index. 

(5) Special difficulties of obtaining satisfactory 
price indexes for a series of periods. The difh- 
culties of obtaining homogeneous price series 
for two periods are, of course, even more apt to 
be encountered if our object is to get a continu- 
ous series of price indexes. Having set up a list 
of satisfactory price series and having secured 
satisfactory weights for one interval we may find 
that another or an amended list of prices and 
weights is more satisfactory for an ensuing in- 
terval. We may be obliged to compute our index 
numbers for separate intervals and splice the in- 
dex numbers in some fashion to get a continuous 
series. Or, in constructing a current extension of 
a series of index numbers we may suddenly find 
that one or more price series are no longer pro- 
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curable. In this case, unless we recompute the 
index from the beginning, we are obliged to 
make some sort of substitution of a new price 
series for the old. 

(6) Chain indexes. In this connection we 
may say that the chain method of computing 
index numbers is adapted to making changes in 
commodity lists. It has been proven, however, 
to be less reliable than the fixed base method as 
a means of obtaining a homogeneous series of 
price indexes. The relation between chain and 
fixed base indexes will be discussed in the follow- 
ing section. 


V. Tue DIVERGENCE OF CHAIN INDEXES 
FROM CORRESPONDING FIxED BASE INDEXES 


If fixed weights are used for all years, any 
average which satisfies the base reversal test 
will also satisfy the circular test. With fixed 
weights, therefore, the chain index for any year, 
computed by an average which satisfies the base 
reversal test, will be identical with the fixed base 
index for the same year computed by the same 
formula. 

If, however, the weights depend upon the data 
of the pairs of years for which the links of the 
chain index are computed, the resulting chain 
index and the corresponding fixed base index 
for the same year, computed by the same for- 
mula, will, in general, differ. 
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It has frequently been observed, in actual 
practice, not only that chain indexes with 
changing weights differ from corresponding 
fixed base indexes, but also that the divergence 
of the two indexes tends to persist and even to 
become irregularly greater and greater as the 
number of links in the chain increases. Thus, in 
the case of Fisher’s 36 commodities the chain 
indexes, computed by our nine formulas as 
shown in Table 6, are greater than the corre- 
sponding fixed base indexes in every instance. 
In a second case, 7.¢., the indexes of the prices 
and quantities of twelve leading crops, 1910-19, 
computed by formula v by the chain method and 
fixed base method, the chain index for prices is 
always less than the corresponding fixed base 
index for prices and the chain index for quanti- 
ties is always greater than the corresponding 
fixed base index for quantities.! 

In spite of the divergence of chain indexes 
from fixed base indexes some students of the 
subject have recommended the use of the former, 


1It was, apparently, on the basis of results actually found 
in special cases that led Fisher to say, “the chain is subject to 
cumulative error and ought not to be used (unless, possibly, 
as supplementary to the fixed base system).’”’ But he incon- 
sistently says on another page, ‘‘the present investigation has 
shown that, in the case of all good index numbers, there is no 
really perceptible difference between the chain and fixed base 
figures.”” See The Making of Index Numbers, p. 319 and p. 309. 

For the indexes of prices and quantities of twelve crops 
see ‘“‘ Fisher’s Formula for Index Numbers,” by Warren M. 
Persons, in the Review of Economic Statistics, May 1921, p. 110. 


66 INDEX NUMBERS 


but no one has made a thoroughgoing investiga- 
tion of the causes of the divergence. 

Some have ascribed the divergence in a single 
year to chance and the cumulative divergence 
over a period of years to a “‘run of luck.”’ Such 
an explanation, however, does not carry convic- 
tion, because prices and quantities in consecu- 
tive years are not “independent events”’ subject 
to the laws of games of chance. 

As a basis for discussing the cause of the di- 
vergence between chain indexes and fixed base 
indexes computed by corresponding formulas, 
we shall find expressions for the rates of diverg- 
ence between them. We shall derive and con- 
sider such expressions for (a) formula v, the 
““ideal,”’ and (5) formula i, the geometric mean. 


(a) Let the ratio of the chain index to the 
fixed base index (computed by formula v in 


1Thus, C. M. Walsh in the Problem of Estimation, 1921, 
PP. 105, 137, says that a Committee of the British Association 
of which Professor F. G. Edgeworth was secretary, ‘rightly 
recommended what has since been called the ‘chain’ system 
of making index numbers for every year compared with its 
predecessor.” 

The discussion of chain indexes by Wesley C. Mitchell in 
the Bulletin of the U. S. Bureau of Labor Statistics, No. 284, 
pp- 81-91, is useless for our purpose because it is limited to a 
comparison of chain and fixed base indexes computed by 
taking the simple arithmetic mean of relatives. The divergence 
between the two indexes thus obtained results mainly from 
the facts that the arithmetic average has an upward base 
bias and that this bias is cumulative in the chain index. 
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both cases) for year # be represented by R,. 
Then 








Re P32P21 n(aem, Zp on 
P31 Zpqs Vpige TUhsqi 

R= (Zhe _ZPaq2 Zr ey 
Zp3q4 Zs Uhig2 Dpaqi 





a(n ZPn-19n-2 , ewes = pogi I) 
ZPn-19n ZPn-29n-1 ZPpiga LPnQs 


Rese ee 2Pdn-t, ,, , UP Bue 
ZPndn+1 UP nln Zpiga LPn4191 


Dividing R,+41 (#+1 factors) by R,(n factors) 
we have: 


Bet eee. ZP1Gn +1 a) 
R, ZPndntt ZPn+191 UPiGn 


which is the expression for the rate of change of 
the chain index as compared with the fixed base 
index between any two consecutive periods. 
This ratio will be greater than, equal to, or less 
than 1. That is to say: 














Ss ; 
Ratt is either = 1, forany given values of pand g. 
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The relationship just given may be expressed 
in terms of three weighted averages of relative 
prices as follows: 





“p, (bid » > F*(bsdn) =" (Pog) 


se a 


ZPidn ZPidn+1 < ZPnQr 
There: are, : course, any number of special 


cases in which =1. In such special cases the 


chain and fixed base indexes will coincide or move 
parallel. In general, however, the chain and fixed 
base indexes will diverge. Not only will the two 
indexes usually diverge, but they will diverge, 
in general, even though prices of all commodities 
in period (w+1) are precisely the same as in 
period (7). 

We may illustrate the sort of thing that may 
occur in actual practice by taking an extreme 
case of the average of the prices of two com- 
modities with fixed prices in all periods except 
the base period, but with quantities which vary 
in pronounced fashion from period to period. 
Consider commodities A and B for 7 periods 
as follows: 











Commodity A: 








Commodity B: 
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The chain and fixed base indexes computed 
by formula vy are as follows: 


Chain index...... 427 | 427 | 427 | 427 | 427 | 427 
Fixed base index. .} 427 | 229 | 427 | 498 | 427 | 427 








It will be noticed that the chain index remains 
fixed at 427 while the fixed base index fluctuates 
widely, between 229 and 4098. This illustration 
shows that by making the quantities vary in a 
certain fashion from year to year it is possible 
to make the fixed base index change more or 
less rapidly (as we wish) than the chain index. 

Consider, in this connection, the indexes for 
the periods 4 and 5 in which the fixed base index 
moves upward faster than the chain index, 2.e., 
the chain index diverges downward. It is obvious 
that the chain index would still tend to diverge 
downward if the prices of commodities A and B 
did not remain absolutely fixed, but instead 
merely maintained their approximate positions 
relative to the fixed base. That is to say, if 
prices do not remain absolutely fixed from period 
to period, but change slightly so that high prices 
in one period are ‘‘about as high” in the suc- 
ceeding period and low prices in one period are 
“about as low” in the succeeding period the 
results of the illustration just considered will 
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tend to be approximated by the chain and fixed 
base indexes. 

The illustration which we have just considered 
suggests that, in actual practice, if important 
commodities rise in price more than the average 
(relative to a fixed base) and remain high in 
price in the succeeding period (relative to the 
average for that period), regardless of higher 
production, and if important commodities fall 
in price more than the average (relative to a 
fixed base) and remain low in price regardless of 
smaller production (i.e., if in general it is difficult 
to adjust production to prices and if more than 
one year is required for adjustment and in the 
meantime production is increased due to stimulus 
of high prices but not enough to cause a signifi- 
cant decline in price, and production of other 
commodities is curtailed on account of lower 
prices but not enough to cause a significant rise 
in prices) then 


pnt (Pidn) will tend to be smaller, on 
pr account of the weights q,, 
LPdn than 


Pn which will tend to be larger 
pe i ger, 
pr (Pin +3) on account of the weights 


ZPidn4+1 Qn4+1 

1 The effect of individual weights and relatives upon the 
resulting index number should be judged not with respect 
to 100, t.e., whether individual prices are increasing or decreas- 
ing, but with respect to P, i.e., whether individual prices are 
changing more or less rapidly than the average. 
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and 
pPatt (p,q) will tend to equal 1 on account 
Pn of the approximate equality 
“Sha Of dng and fy 
and 
=a will tend to <1 


and 
the fixed base index will tend upward faster than 
the chain index or, in other words, the chain 
index will diverge downward. 

The situation just described was actually 
found to exist for the chain and fixed base index 
numbers of prices for 12 leading crops during 
the years of pronounced divergence of the two 
indexes in 1914-15, 1916-17, and 1918-19.! The 
chain and fixed base indexes of prices of 12 crops 
for the years 1910-19 are as follows: 








Chain Index, .|_ 2220 |_2977 | 1912 | 1913 | r9T4 
Fixed Base 100.0 | 1009.2 93-7 | 110.8 | 100.6 


dindexcnpees 100.0 | 109.2 94.1 | 110.9] 100.8 

















6 8 
Ghai idee oe IgI7 | IQ 1919 


Fixed Base T00.7 | 152.0 | 206.4 |-217.4 | 220.7 
inde xterercsy TO2:4y eG 4.0) metro) 222-222 O55 


























1 See ‘‘Fisher’s Formula for Index Numbers,”’ by Warren 
M. Persons, in the Review of Economic Statistics, May 1921, 
p. 110. 
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In each of the intervals 1914-15, 1916-17, 
1918-19, the prices of important (i.e., heavily 
weighted) commodities that were relatively 
high (with respect to 1910) in the first year of 
the interval remained relatively high in the 
second year of the interval and the prices of 
important commodities that were relatively 
low in the first year remained relatively low in 
the second year of the interval. Also g,41 in 
general was greater than g, for the relatively 
high-priced commodities and q,41 was, in gen- 
eral, less than g, for the low-priced commodities. 

We may further add, in confirmation of the 
generalization, that an actual case of the con- 
verse relationship (to that for the prices of 12 
crops) between the chain and fixed base indexes 
is afforded by Fisher’s 36 commodities. From 
Table 6 we see that the greatest divergence of 
the chain and fixed base indexes was in the in- 
terval 1914-15. In 1914 the “ideal” indexes are 
coincident but in 1915 the fixed base index was 
only 99.89 compared with 100.23 for the chain 
index. That is to say, the chain index diverged 
upward (instead of downward as in the case of 
the prices of 12 crops for 1914-15). Investiga- 
tion of the position of the price relatives of the 
most important commodities in 1914 and 1915 
showed that the high relatives in 1914 did not 
remain high in 1915. Thus, pork, one of the 
most important of Fisher’s 36 commodities, 
shifted its position from 4 per cent above the 
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average to 4 per cent below the average. Thus, 
a condition that was approximately fulfilled for 
the relative prices of 12 crops in 1914-15 failed 
to be fulfilled for important constituents of 
Fisher’s 36 commodities for the same interval. 
We find, in fact, that the chain index for the 36 
commodities diverged upward in 1914-15 instead 
of downward as it did for the prices of 12 crops. 

We may prove, further, that if the chain index 
for prices (computed by formula v, the “‘ideal”’) 
is greater than the fixed base index for prices, 
computed by the same formula, then the correla- 
tive chain index for- quantities will be less 
than the fixed base index for quantities. The 
proof follows: 

Let RY and R® be the ratios of the chain 
indexes to the fixed base indexes of prices and 
quantities, respectively. Then 


RY = { EPGrr.,., Z poy ZPiIn 


) % 
‘i ( = Pn-19n Zpig2 ZPnQr 

Ro =( EquPmr. Ja 

2Gn-1P 2qipe Unf 
and RY. RY =1. 
Since R?)-R@=1= 
(chain index of prices) (chain index of quantities) 

(fixed base index of prices) (fixed base index of quantities) 





and the denominator of the right-hand side of 
the equation equals 2,q,/Zpiqi or the value 
ratio, the numerator will also equal the value 
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ratio. The theorem stated above immediately 
follows. 


(b) The expression for A 





depends, of course, 


upon the formula used for the chain and fixed 
base average. It is hardly worth while to find 
the expression for all of our nine formulas, but 
we shall find it for one other average, formula i. 
The rate of change of the chain index relative to 
the fixed base index for any interval is: 


0 i “(e a 
Rass Pn pi 


= 


KR Ul ( Pati ea 
fi 





where II stands for “‘the product of such terms 


as” and 
W..= (fae Pdr + Lm | 
Sa LPs 


The expression for the rate of change for the 
chain index relative to the fixed base index, using 
geometric means, is derived as follows: 
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Where ~n41= 2, we have 





Pn Wal 
IT (4) Pn Wni-Wa4t1d 
Sear accra, 
Om 
Pi 
Prdn 4 Pi all 
ZPn a Zpin 


oe y4( fates PrnQn +1 —) 
Seer Zhi 


in which W,.3= ee 


Pndn Pnn+1 
Ws es iz (fats - Pefest ) 
Besa ZPnQn UPnQn+1 


If foi, and p, are approximately equal and 
if for those relatives for which ~,>fi, 


Pn +1 > PnQn 
ZPndn +1 ZPndn 


Pn<Pr, Prints << Pits then Ratt win tend to 
ZPndn +1 ne Re 


be < 1 and the chain index will tend to diverge 
downward from the fixed base index. That is to 
say, in such a case the fixed base index will tend 
to move upward with respect to the chain index 
(which in this case moves horizontally) if the 
high relatives (i.e., more than roo per cent) are 
associated with larger quantities in period (n+1) 
than in period (m) and if the low relatives (i.e., 


and for those relatives for which 
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less than roo per cent) are associated with 
smaller quantities in period (m+1) than in 
period (m). If, on the other hand, the high 
relatives are associated with smaller quantities 
in period (n+1) than in period (m) and the low 
relatives are associated with larger quantities in 
period (n+1) than in period () the chain index 
would tend to diverge upward with respect to 
the fixed base index. 

In general, for any values for prices and quan- 
tities the chain index will tend to diverge down- 
ward with respect to the fixed base index, if 


Wout 
a) 
pr <n( Pn ie 


= aye 411 te 
fi 


The chain index will diverge upward from the 
fixed base index if the left-hand side of the above 
expression is greater than the right-hand side. 
We may say for formula i as we did for formula 
v that if, in actual practice, important commodi- 
ties rise in price more than the average (relative 
to a fixed base) and remain high in price in the 
succeeding period regardless of higher production 
and if important commodities fall in price more 
than the average (relative to a fixed base) and 
remain low in price regardless of smaller produc- 
tion (i.e., if in general it is difficult to adjust 
production to prices and if more than one period 
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is required for adjustment and in the meantime 
production is increased due to the stimulus of 
high prices but not enough to cause significant 
decline in price, and production of other com- 
modities is curtailed on account of low prices 
but not enough to cause a significant rise in 
prices) then 


ll Pn Ward will tend to be smaller, on 
pr account of g, in the weights, 
than 
o( Pett Yn+11 which will tend to be larger, 
pr on account of g,41 in the 
weights, 
and 


0 Wntin — will tend to equal 1 on account 
of the approximate equality of 


Pn+i and Pn 
and 


wal will tend to < 1 


n 


and 


the fixed base index will tend upward faster than 
the chain index or, in other words, the chain 
index will diverge downward. 


If a situation leading to the divergence of 
chain and fixed base indexes, such as we have 
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described, persists over several consecutive years 
the two indexes will move farther and farther 
apart. In fact the divergence will cumulate 
according to the law of compound interest. 

Thus, suppose we have three years 1, 2 and 3 
such that the chain index for the third year bears 
the ratio R; to the fixed base index (P31). That 
is to say let 
od Po1+P32 

Psi 


Suppose that the values of p and q of year 1 are 
repeated in year 4, those of year 2 are repeated 
in year 5, and those of year 3 are repeated in 
year 6. Then 
= Post Ps2° Pas? P5a°Pos 

es 


Since, by assumption, P43=P13, Ps4=Po1, Pes 
=Ps, Poi=P 1, and ye 1= 5— we have 


31 


. 2 
R= (eS =(R,)? 


31 


R; 


Rs 


Therefore, if a situation resulting in the di- 
vergence of chain and base index is repeated in 
consecutive periods the two indexes will diverge 
cumulatively in geometric progression. 


From the results presented above it is clear 
that the chain index not only deviates from the 


80 INDEX NUMBERS 


corresponding fixed base index, but the deviation 
is cumulative in accordance with the law of com- 
pound interest, and, in actual practice, the two 
indexes are apt to diverge more and more widely 
as the number of links in the chain increases. 
We conclude that a chain index is an unre- 
liable substitute for a fixed base index and 
should not be used unless the weights are con- 
stant, z.e., unless the index satisfies the circular 
test and the chain and fixed base indexes coin- 
cide. This conclusion indicates another, 7.e., the 
best fixed base index to use for a comparison 
involving more than two periods is either the 
geometric average with fixed weights or the 
aggregative average with fixed weights. 


VI. SumMMARY 


1. The object of the present study is that of 
measuring the average change of prices or quan- 
tities of two or more commodities during two or 
more periods. In other words, our problem is that 
of averaging relatives to secure index numbers. 

2. In attacking this problem we confront at 
the outset the difficulties of (a) the selection of 
weights in cases for which, in actual practice, 
weights and relatives may or may not be corre- 
lated and (0) the selection of an average — from 
the familiar types — which will be adapted to 
the selected method of weighting. 

3. Eleven formulas for index numbers were 
derived, for binary comparisons, in which 
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weights and averages were selected in such a 
way as to eliminate the bias between weights 
and the corresponding relatives. Nine of the 
eleven formulas satisfy the base reversal test. 

4. Averages of relative prices, in order to 
satisfy the base reversal test in general, must 
have fixed weights when the base is reversed. 
That is to say, each relative must appear some- 
where in the formula with the same weight 
when the index is figured forward and backward. 
The logical extension of the base reversal test 
for two periods is the circular test for three or 
more periods. Satisfaction of the circular test by 
an average necessitates the use of fixed weights. 

5. Exact satisfaction of the factor reversal 
test is not of great practical importance in the 
construction of index numbers, and exact satis- 
faction of the base reversal test is not a neces- 
sary condition for the securing of a good index 
number if weights and averages are selected 
in such a way as to avoid weight correlation 
bias. 

6. Chain indexes will, in general, differ from 
corresponding fixed base indexes for the same 
periods on account of differences in weighting 
alone. That is to say, even though the prices 
of all the articles remain fixed during two or 
more periods the chain index computed by any 
formula will, in general, deviate from the fixed 
base index computed according to the same 
formula; further, the source of the deviation is 
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the difference in weights of the chain and fixed 
base indexes. 

7. Between any pair of consecutive periods 
the chain index will increase less rapidly (or 
decrease more rapidly) than the fixed base index 
computed by the same formula if a certain kind 
of relationship or correlation exists between the 
movements of prices and movements of weights 
for those periods. Likewise, the chain index will 
increase more rapidly (or decrease less rapidly) 
than the fixed base index if the converse of the 
relationship or correlation (indicated in the pre- 
ceding statement) exists. 

8. Specifically, if we use formula v, the 
“ideal,” the “certain kind of relationship or 
correlation between the movements of prices and 
movements of weights” that will cause the chain 
index to increase less rapidly in any interval than 
the fixed base index is one in which the price 
relatives of period (m) continue high in period 
(n+1), but quantities increase and/or low price 
relatives of period (m) continue low in period 
(n+1), but quantities decrease so that the weights 
DP: In+1 are positively associated with high rela- 
tives and negatively associated with low rela- 
tives. In such cases (p,¢,) would be less highly 
positively associated with high relatives and more 
highly negatively associated with low relatives. 

g. The situation described in the preceding 
paragraph is apt to occur if the adjustment of 
the quantities to the stimulus of high prices or 
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the depressing effect of low prices are not in fact 
accomplished in the interval between the two 
periods in question. If the adjustments of quan- 
tities to prices take several periods the chain 
index would tend to diverge more and more 
widely downward from the fixed base index. 
This is more apt to be the situation for agricul- 
tural products than for the products of manu- 
facturing and mining. 

to. Even if there were no persisting correla- 
tion between relatives and weights of certain 
commodities, since deviations of the chain index 
from the fixed base index are cumulative “a run 
of luck”’ would at times in actual practice lead 
to a wider and wider divergence of the two in- 
dexes. For this reason alone the chain index is 
“suspect”? and cannot be recommended as a 
reliable form of average. 

11. In general, entirely aside from ‘‘runs of 
luck,” the persistence of economic situations 
over a series of periods makes it likely that the 
chain index will, at times, diverge more and more 
widely (in one direction or the other) from the 
fixed base index. 

12. When there is a significant divergence of 
the chain and fixed base indexes which is pref- 
erable? The answer depends upon the compari- 
son we wish to make. If we desire merely to 
compare the average price of one period with 
that of the preceding period the link of the chain 
is the index to be preferred. If we desire to 
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compare the average price of each year (”) and 
(n+1), respectively, with the base year the fixed 
base index is to be preferred. If our object be 
that of comparing each period (n) and (n+1) 
with the other and with the base simultaneously 
neither the chain indexes nor the fixed base in- 
dexes (with the weights of two years only) is 
satisfactory. 

13. The only way in which the fixed base 
indexes and the chain index can be made to give 
results which are not inconsistent with each 
other for an intercomparison of three or more 
periods is to use fixed or constant weights in 
constructing indexes of average prices for the 
periods to be compared. In other words, satis- 
faction of the “‘circular test” is indicated. 

14. Therefore, for an intercomparison of aver- 
age prices for three or more periods a weighted 
average with fixed weights is to be preferred. 
Such an average should satisfy the base reversal 
test (the geometric or aggregative average) and 
the fixed weights should depend upon the average 
prices and average quantities of all the periods 
in question. Since the weights have, in actual 
practice, less influence on the result than the 
relatives averaged, the method of averaging 
prices and of averaging quantities to secure 
weights is of decidedly less importance than the 
method of averaging the relatives. The aggre- 
gates (or arithmetic averages!) of prices and of 


1 The kind of average selected in obtaining weights, being 
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quantities are, therefore, recommended as the 
factors to be used in securing fixed or constant 
weights. 

15. The decision concerning the base to be 
used depends upon the particular comparison 
desired. If the object is an intercomparison of 
each index with every other one an average base 
period of all the constituent periods is preferred. 
Since constant weights are used the same result 
would be obtained by shifting the base to any 
desired year, by use of a factor, as that which 
would have been obtained by using the desired 
year in the first instance. 

16. We may point out in this connection 
that for the base period the dispersion of price 
relatives is zero and that the choice of base 
determines the dispersion of price relatives for 
other periods. 

17. The best index number of prices or quanti- 
ties for a binary comparison is given by the 
“ideal” formula.! The aggregative average with 
arithmetic mean quantities or prices as weights? 
— depending on whether we are computing an 
index of prices or quantities — however, is a 


of much less significance on the result than the kind of average 
used in computing the price index, greater simplicity and ease 
of computation of one average as compared with others are 
dominating considerations. With adding machines available, 
arithmetic averages are about as easy to ascertain, for instance, 
as medians. 

1 Approved by Fisher, Pigou, Walsh, and Allyn Young. 

2 Approved by Edgeworth, Fisher, Marshall, and Walsh. 
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very close contender for first place and, on 
account of its greater simplicity and ease of com- 
putation, is generally to be preferred in actual 
practice. 

18. The best index number of prices or quanti- 
ties for an intercomparison of three or more peri- 
ods is the aggregative average with fixed weights 
based upon arithmetic mean quantities of prices, 
respectively, for all the periods covered. 

19. Geometric averages of relatives of prices 
or quantities with fixed value weights based on 
arithmetic averages of prices and quantities for 
the periods being compared, two or more, in 
actual practice for a large number of commodi- 
ties for which the weights and relatives are not 
extremely dispersed, conform closely to the 
“ideal” and aggregative indexes described above. 
Problems may arise in which the geometric 
average of relatives is logically indicated and in 
such cases the geometric average of relatives is, 
of course, to be preferred even though the results 
differ substantially from other averages. 

20. When we have adopted a certain base and 
constant quantities for an index which is being 
continued currently we should not think that 
our problem has been settled once and for all. 
The continuity of series of index numbers is 
necessarily interrupted by economic changes. 
New data may make both the base and the 
quantities obsolete. The data, therefore, should 
be constantly watched and the index should be 
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periodically revised to correspond to current 
available data. 

In order to avoid correlation bias between 
weights and relatives in a time series for all possi- 
ble cases of positive, zero, or negative correla- 
tion between relatives and weights it is abso- 
lutely necessary to use fixed or constant weights 
whether the index numbers are computed for 
two periods or for more than two periods. These 
fixed weights measure only approximately the 
relative importance of the different commodities 
in the different years. As new weights become 
available tests should be made of the effect of 
using such new weights. If the effect is signifi- 
cant these new fixed weights should, in general, 
supplant the old weights. 

Comprehensive averages of price changes for 
widely different (from an economic point of 
view) places or periods of time may be impos- 
sible to obtain, not because of the lack of satis- 
factory methods of averaging, but because of the 
lack of homogeneous data to be averaged. In such 
cases we are limited to the comparison of price 
changes of those commodities bought and sold in 
economically significant and comparable amounts 
in the periods compared. 
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